Your browser doesn't support javascript.
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
Add filters

Language
Document Type
Year range
1.
biorxiv; 2024.
Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2024.02.07.579374

ABSTRACT

Public health researchers and practitioners commonly infer phylogenies from viral genome sequences to understand transmission dynamics and identify clusters of genetically-related samples. However, viruses that reassort or recombine violate phylogenetic assumptions and require more sophisticated methods. Even when phylogenies are appropriate, they can be unnecessary or difficult to interpret without specialty knowledge. For example, pairwise distances between sequences can be enough to identify clusters of related samples or assign new samples to existing phylogenetic clusters. In this work, we tested whether dimensionality reduction methods could capture known genetic groups within two human pathogenic viruses that cause substantial human morbidity and mortality and frequently reassort or recombine, respectively: seasonal influenza A/H3N2 and SARS-CoV-2. We applied principal component analysis (PCA), multidimensional scaling (MDS), t-distributed stochastic neighbor embedding (t-SNE), and uniform manifold approximation and projection (UMAP) to sequences with well-defined phylogenetic clades and either reassortment (H3N2) or recombination (SARS-CoV-2). For each low-dimensional embedding of sequences, we calculated the correlation between pairwise genetic and Euclidean distances in the embedding and applied a hierarchical clustering method to identify clusters in the embedding. We measured the accuracy of clusters compared to previously defined phylogenetic clades, reassortment clusters, or recombinant lineages. We found that MDS maintained the strongest correlation between pairwise genetic and Euclidean distances between sequences and best captured the intermediate placement of recombinant lineages between parental lineages. Clusters from t-SNE most accurately recapitulated known phylogenetic clades and recombinant lineages. Both MDS and t-SNE accurately identified reassortment groups. We show that simple statistical methods without a biological model can accurately represent known genetic relationships for relevant human pathogenic viruses. Our open source implementation of these methods for analysis of viral genome sequences can be easily applied when phylogenetic methods are either unnecessary or inappropriate.

2.
medrxiv; 2022.
Preprint in English | medRxiv | ID: ppzbmed-10.1101.2022.09.02.22279519

ABSTRACT

Background: Causes of non-malarial fevers in sub-Saharan Africa remain understudied. We hypothesized that metagenomic next-generation sequencing (mNGS), which allows for broad genomic-level detection of infectious agents in a biological sample, can systematically identify potential causes of non-malarial fevers. Methods and Findings: The 212 participants in this study were of all ages and were enrolled in a longitudinal malaria cohort in eastern Uganda. Between December 2020 and August 2021, respiratory swabs and plasma samples were collected at 313 study visits where participants presented with fever and were negative for malaria by microscopy. Samples were analyzed using CZ ID, a web-based platform for microbial detection in mNGS data. Overall, viral pathogens were detected at 123 of 313 visits (39%). SARS-CoV-2 was detected at 11 visits, from which full viral genomes were recovered from nine. Other prevalent viruses included Influenza A (14 visits), RSV (12 visits), and three of the four strains of seasonal coronaviruses (6 visits). Notably, 11 influenza cases occurred between May and July 2021, coinciding with when the Delta variant of SARS-CoV-2 was circulating in this population. The primary limitation of this study is that we were unable to estimate the contribution of bacterial microbes to non-malarial fevers, due to the difficulty of distinguishing bacterial microbes that were pathogenic from those that were commensal or contaminants. Conclusions: These results revealed the co-circulation of multiple viral pathogens likely associated with fever in the cohort during this time period. This study illustrates the utility of mNGS in elucidating the multiple causes of non-malarial febrile illness. A better understanding of the pathogen landscape in different settings and age groups could aid in informing diagnostics, case management, and public health surveillance systems.


Subject(s)
Malaria , Euthyroid Sick Syndromes , Fever
3.
medrxiv; 2021.
Preprint in English | medRxiv | ID: ppzbmed-10.1101.2021.09.21.21258385

ABSTRACT

During the COVID-19 pandemic within the United States, much of the responsibility for diagnostic testing and epidemiologic response has relied on the action of county-level departments of public health. Here we describe the integration of genomic surveillance into epidemiologic response within Humboldt County, a rural county in northwest California. Through a collaborative effort, 853 whole SARS-CoV-2 genomes were generated, representing [~]58% of the 1,449 SARS-CoV-2-positive cases detected in Humboldt County as of mid-March 2021. Phylogenetic analysis of these data was used to develop a comprehensive understanding of SARS-CoV-2 introductions to the county and to support contact tracing and epidemiologic investigations of all large outbreaks in the county. In the case of an outbreak on a commercial farm, viral genomic data were used to validate reported epidemiologic links and link additional cases within the community who did not report a farm exposure to the outbreak. During a separate outbreak within a skilled nursing facility, genomic surveillance data were used to rule out the putative index case, detect the emergence of an independent Spike:N501Y substitution, and verify that the outbreak had been brought under control. These use cases demonstrate how developing genomic surveillance capacity within local public health departments can support timely and responsive deployment of genomic epidemiology for surveillance and outbreak response based on local needs and priorities.


Subject(s)
COVID-19
4.
preprints.org; 2020.
Preprint in English | PREPRINT-PREPRINTS.ORG | ID: ppzbmed-10.20944.preprints202008.0220.v1

ABSTRACT

The Public Health Alliance for Genomic Epidemiology (PHA4GE) (https://pha4ge.org) is a global coalition that is actively working to establish consensus standards, document and share best practices, improve the availability of critical bioinformatic tools and resources, and advocate for greater openness, interoperability, accessibility and reproducibility in public health microbial bioinformatics. In the face of the current pandemic, PHA4GE has identified a clear and present need for a fit-for-purpose, open source SARS-CoV-2 contextual data standard. As such, we have developed an extension to the INSDC pathogen package, providing a SARS-CoV-2 contextual data specification based on harmonisable, publicly available, community standards. The specification is implementable via a collection template, as well as an array of protocols and tools to support the harmonisation and submission of sequence data and contextual information to public repositories. Well-structured, rich contextual data adds value, promotes reuse, and enables aggregation and integration of disparate data sets. Adoption of the proposed standard and practices will better enable interoperability between datasets and systems, improve the consistency and utility of generated data, and ultimately facilitate novel insights and discoveries in SARS-CoV-2 and COVID-19.


Subject(s)
COVID-19 , Fractures, Open
SELECTION OF CITATIONS
SEARCH DETAIL